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Abstract 

The  SuM-to-MlN  transition  that  children  exhibit  when 
learning  to  add  provides  an  ideal  domain  for  studying 
naturally  occurring  discovery  processes.  We  discuss  a 
computational  model  that  acconnts  for  this  transition, 
including  the  appropriate  intermediate  strategies.  In  or¬ 
der  to  account  for  all  of  these  shifts,  the  model  must 
sometimes  learn  without  the  benefit  of  impasses.  Our 
model  smoothly  integrates  impasse-driven  and  impasse- 
free  learning  in  a  single,  simple  learning  mechanism. 

Introduction 

This  paper  discusses  models  for  the  well-known  SUM- 
to-MlN  transition  (Ashcraft,  1982, 1987;  Groen  li  Park- 
man,  1972;  Groen  b.  Resnick,  1977;  Kaye,  Post,  Hall, 
b.  Dineen,  1986;  Siegler  b  Jenkins,  1989;  Svenson, 
1975).  This  is  an  ideal  domain  for  studying  natu¬ 
rally  occurring  discovery  processes  (Siegler  b  Jenk¬ 
ins,  19891.  When  young  children  first  learn  to  add 
two  small  numbers,  they  use  the  so-called  Sum  strat¬ 
egy.  They  create  sets  of  objects  to  represent  each  ad¬ 
dend,  then  count  the  objects  in  the  union  of  the  two 
sets.  For  instance,  in  order  to  solve  2  +  3,  the  child 
says,  “1,  2”  while  raising  two  fingers  on  the  left  hand; 
then  “1,  2,  3”  while  raising  three  fingers  on  the  right 
hand;  then  “1,  2,  3,  4,  5,”  while  counting  all  the  raised 
fingers.  This  strategy  is  called  the  SUM  strategy  be¬ 
cause  its  execution  time  is  proportional  to  the  sum  of 
the  two  addends.  Older  children  use  a  more  efficient 
strategy,  the  MlN  strategy,  whose  execution  time  is 
proportional  to  the  minimum  of  the  two  addends.  In 
following  this  strategy,  the  child  first  announces  the 
value  of  the  larger  addend,  then  counts  on  from  it. 
For  instance,  in  order  to  solve  2  -b  3,  the  child  would 
say  “3”  then  say  “4,5”  while  raising  two  fingers  on  one 
hand. 

Although  the  Sum  strategy  is  taught  in  school,  the 
Min  strategy  appears  to  be  invented  by  the  children 
themselves.  The  best  evidence  for  this  comes  from 
a  longitudinal  study  by  Siegler  and  Jenkins  (1989). 
They  interviewed  8  children  weekly  for  11  weeks,  each 
time  asking  them  to  solve  about  15  orally  presented 
.addition  problems.  After  each  problem,  the  child  was 
asked  how  they  got  their  answer.  The  child  was  also 
told  whether  their  answer  was  correct,  and  was  given 
a  gold  star  if  it  was.  \'idcotapes  were  analyzed  and 
the  child’s  behavior  on  each  problem  w-as  classified  ac- 


This  research  bencfiterl  from  discussions  with  Jeff 
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cording  to  the  strategy  used.  As  far  as  Siegler  and 
Jenkins  could  determine,  the  only  instruction  that  the 
subjects  received  during  this  period  was  their  school’s 
normal  instruction  on  the  SUM  strategy.  Nonetheless, 
7  of  the  8  children  began  to  use  the  Min  strategy. 
Moreover,  they  appear  to  have  discovered  it  during 
the  video-taped  sessions.  The  tapes  make  it  clear  that 
they  received  no  help  from  the  experimenter,  so  the 
Min  strategies  appear  to  have  been  invented  by  the 
subjects  themselves. 

The  issue  addressed  by  this  paper  is  explaining  how 
children  discover  the  Min  strategy.  This  particular 
discovery  presents  a  challenge  for  current  theories  of 
learning.  Although  a  variety  of  learning  methods  have 
been  proposed,  many  of  them  are  triggered  when  the 
problem  solver  reaches  an  impasse,  and  yet  Siegler  and 
Jenkins  found  no  signs  of  impasses  during  the  dis¬ 
covery  of  the  Min  strategy.  The  exact  definition  of 
“impasse”  depends  on  the  problem-solving  architec¬ 
ture,  but  roughly  speaking,  an  impasse  occurs  when¬ 
ever  the  solver  has  a  goal  that  cannot  be  achieved  by 
any  operator  that  is  believed  to  be  relevant  to  the 
task  at  band.  The  essential  idea  of  impasse-driven 
learning  is  to  resolve  the  impasse  somehow,  then  store 
the  resulting  experience  in  such  a  way  that  future  im¬ 
passes  will  be  avoided  or  at  least  handled  more  ef¬ 
ficiently.  Many  systems  use  impasse-driven  learning, 
including  Swale  (Schank,  1986),  Occam  (Pazzani, 
Dyer  b  Flowers,  1986),  Lparsifal  (Berwick,  1985), 
Soar  (Newell,  1990),  Sierra  (VanLehn,  1990),  and 
Cascade  (VanLehn  b  Jones,  in  press). 

Siegler  and  Jenkins  looked  specifically  for  signs  of 
impasses  and  found  none.  In  particular,  they  designed 
some  of  the  problems  to  cause  impasses  by  making  one 
of  the  addends  very  large  (e  g,,  23  +  1).  They  found 
that  “The  specific  problems  on  which  the  children  first 
used  the  MiN  strategy  were  2-1-5,  4 -b  1,  3-1- 1,  1-1-24, 
5  -b  2,  and  4+3.  These  problems  did  not  deviate  from 
the  characteristics  of  the  overall  set  in  any  notable 
way.”(p.  67)  In  fact,  some  of  the  children  had  ear¬ 
lier  successfully  solved  exactly  the  same  problem  that 
they  were  working  on  when  they  discovered  the  MlN 
strategy.  Although  the  impasse  problems  did  cause 
subjects  who  had  already  invented  the  Min  strategy 
to  start  using  it  more  frequently,  the  problems  did  not 
cau«:e  those  who  had  not  invented  the  strategy  to  do 
so. 

Siegler  and  Jenkins  sought  signs  of  impasses  by  e.\- 
amining  solution  times  and  errors  in  the  vicinity  of  the 
discovery  events.  Solution  times  w»re  longer  than  aor- 
inai  tor  liie  problems  where  the  discovery  occurred  and 
for  the  problems  immediately  preceding  the  discovery 
trial.  This  might  suggest  some  kind  of  impasse,  but  er¬ 
ror  rates  indicated  that  the  problems  themselves  were 
not  particularly  difficult.  Siegler  and  Jenkins  suggest 


a  reconciliation  between  their  findings  and  impasse- 
driven  learning  theories: 

Two  types  of  strategy  changes  can  be  distinguished: 
changes  in  which  the  main  difference  between  the 
strategies  is  in  the  answers  themselves,  and  changes 
in  which  the  main  differences  are  not  in  the  answers 
that  are  generated  but  rather  in  the  efficiency  with 
which  aiibwcrs  are  generated  and/or  the  aesthetic 
appeal  of  the  procedures.  The  first  type  of  strategy 
change  may  occur  primarily  as  a  result  of  encoun¬ 
tering  impasses,  but  the  second  may  typically  occur 
for  other  reasons,  fp.  104) 

One  model  in  the  literature  effects  a  strategy  change 
whenever  it  detects  an  opportunity  for  improving  the 
efficiency  of  a  procedure.  HPM  (Neches,  1987)  keeps 
a  complete  trace  of  its  processing,  which  is  constantly 
monitored  by  heuristics  such  as:  “If  a  subprocedure 
produces  an  output,  but  no  other  subprocedure  re¬ 
ceives  that  result  by  the  time  the  overall  procedure 
finishes,  then  modify  the  overall  procedure  to  eliminate 
the  superfluous  computation.”  Neches  demonstrated 
that  this  heuristic  and  two  others  sufficed  for  changing 
the  Sum  strategy  into  the  MlN  strategy. 

Enroute,  two  transitional  strategies  are  necessarily 
produced  by  HPM.  Siegler  and  Jenkins  sought  evi¬ 
dence  for  these  transitional  strategies  in  their  data. 
One  of  the  strategies  occurred  6  times,  all  in  the  proto¬ 
col  of  the  same  subject.  Moreover,  all  these  instances 
occurred  after  the  Min  strategy  was  invented.  The 
second  transitional  strategy  pr^icted  by  Neches  did 
not  appear  at  all.  These  unfulfilled  pr^ictions  cast 
doubt  on  the  HPM  model.  The  model  itself,  as  Neches 
noted,  “assumes  the  relative  accessibility  of  extremely 
detailed  information  about  both  on-going  process  and 
related  past  experiences.  How  can  this  be  reconciled 
with  known  limitations  on  the  ability  to  report  this  in¬ 
formation?”  (p.  213)  Although  HPM  is  com.putation- 
ally  sufficient  to  produce  the  SuM-to-MlN  transition, 
it  makes  dubious  empirical  and  mneumonic  assump¬ 
tions. 

The  objective  of  this  research  has  been  to  gener¬ 
ate  a  computationally  sufficient  account  of  the  SuM- 
to-MlN  transition  that  produces  only  observed  tran¬ 
sitional  strategies  and  makes  plausible  demands  on 
memory.  The  result  is  a  problem  solver  called  GiPS 
(General  Inductive  Problem  Solver).  This  paper  de¬ 
scribes  GlPS  and  its  account  of  the  strategy  shifts  ob¬ 
served  by  Siegler  and  Jenkins  (1989). 

GiPS  is  a  generalized  means-ends  analysis  (MEA) 
problem  solver  (Jones,  1989)  whose  primary  learning 
mechanism  is  based  on  Schlimmer’s  (1987;  Schlimmer 
&  Granger,  1986a,  1986b)  Stagger  system,  which 
uses  a  probabilistic  induction  technique  to  learn  con¬ 
cept  descriptions  from  examples.  Other  systems  have 
combined  inductive  concept  learners  with  problem  solv¬ 
ers  (e.g.,  Langley,  1985;  Mitchell,  Utgoff  &  Banerji, 
1983),  but  they  acquire  only  search-control  knowledge; 
concepts  that  indicate  which  operator  to  select  when 
several  operators  match  the  current  goal.  GiPS  modi¬ 
fies  the  descriptions  of  the  operators  themselves  as  well 
as  the  heuristics  for  selecting  operators.  Both  types  of 
learning  play  crucial  roles  in  making  the  SuM-to-MlN 
transition. 

Following  •>  brief  description  of  GiPS  (see  Jones  A: 
VanLehn,  1991,  for  a  complete  descriptiorn  is  the  main 
section  of  this  paper,  which  presents  the  GiPS  account 
of  the  SUM-to-MlN  (.ransition. 


The  General  Inductive  Problem  Solver 
GiPS’  basic  problem-solving  algorithm  is  a  general¬ 
ize  version  of  MEA  borrowed  from  the  Eureka  sys¬ 
tem  (Jones,  1989).  It  is  based  on  trying  to  achieve  a 
state  change.  The  desired  change  is  represented  by  a 
Transform,  which  is  simply  a  pair  consisting  of  the 
current  state  and  some  goal  conditions.  In  order  to 
achieve  this  transformation,  GiPS  selects  an  operator 
and  attempts  to  apply  it.  If  the  operator’s  precon¬ 
ditions  are  met,  it  is  executed  and  the  current  state 
changes.  If  some  of  the  preconditions  are  not  met,  a 
new  Transform  is  created  with  them  as  the  goals. 
When  this  transformation  is  achieved,  GiPS  returns 
to  the  old  Transform  and  attempts  again  to  apply 
the  operator.  So  far,  this  is  just  the  standard  MEA 
algorithm,  but  GiPS  adds  two  important  differences. 

In  standard  MEA,  operators  are  selected  if  they 
reduce  the  difference  between  the  current  and  goal 
states.  In  GiPS,  selection  is  determined  by  selection 
concepts.  Each  operator  has  a  concept  that  indicates 
when  it  should  be  selected.  If  the  concept  depends 
mostly  on  the  current  state  of  the  TRANSFORM,  then 
the  operator  will  act  like  a  forward-chaining  inference 
rule  and  fire  whenever  the  state  is  appropriate,  regard¬ 
less  of  the  current  goals.  If  the  concept  depends  mostly 
on  the  goals  of  the  TRANSFORM,  then  it  will  act  like  a 
backward-chaining  inference  rule.  Typically,  forward 
and  backward  operators  intermingle  during  problem 
solving,  yielding  a  psychologically  plausible  blend  of 
goal-directed  and  opportunistic  behavior. 

Each  operator  has  a  selection  concept.  The  concept 
is  represented  as  set  of  literals  (predicates  that  mayor 
may  not  be  negated),  and  each  literal  has  two  values 
associated  with  it:  ib  sufficiency  and  its  necessity.  In 
order  to  evaluate  the  worth  of  selecting  an  operator, 
GiPS  matches  the  literals  against  the  current  Trans¬ 
form.  It  determines  the  subset  of  literals  that  match 
(M)  and  fail  to  match  (F),  then  calculates 

Odds{C)  sufficiency{L)  necessity{L), 
L^M  L£F 

where  Odds(C)  is  the  prior  probability  that  the  con¬ 
cept  is  worth  selecting.  This  is  the  same  formula  used 
by  Stagger,  Schlimmer’s  (1987)  propositional  con¬ 
cept  formation  system,  to  estimate  the  probability  that 
a  riven  object  is  an  instance  of  a  particular  concept. 
When  all  the  operators  have  been  matched  and  their 
worth  has  been  calculated,  GiPS  chooses  the  one  with 
the  highest  rating. 

GiPS  adjusts  its  selection  concepts  on  the  basis  of  its 
successes  and  failures  while  solving  problems.  When 
a  problem  is  finally  solved,  for  each  operator  along 
the  solution  path,  GiPS  adjusts  the  sufficiency  and  ne¬ 
cessity  values  so  that  the  operator  will  be  rated  even 
higher  the  next  time  a  similar  Transform  occurs. 
For  each  operator  that  initiated  a  failure  path,  GiPS 
adjusts  the  values  in  its  selection  concept  so  that  it 
will  receive  a  lower  value  next  time.  In  order  to  learn, 
GiPS  must  store  the  solution  path  and  every  opera¬ 
tor  that  led  off  it.  However,  as  soon  a.s  the  problem 
IS  nnisne>l  and  the  updating  is  con.^j!  ted,  this  iufor- 
mation  can  be  forgotten.  HPM  (Neches,  1987)  must 
store  the  whole  search  tree,  not  just  the  solution  path, 
for  an  indefinite  period.  Holland,  Holyoak,  Nisbett, 
and  Thagard  (1986)  describe  a  technique,  called  the 
bucket  brigade  algorithm,  that  achieves  the  same  kind 


of  update  as  GiPS  without  storing  any  of  the  solution 
path. 

The  mechanism  described  so  far  can  learn  search- 
control  knowledge,  as  do  many  other  machine  learning 
systems.  In  order  to  make  the  SUM-to-MlN  transition, 
the  system  must  modify  the  preconditions  of  operators 
as  well.  In  standard  MEA,  after  an  operator  has  been 
selected,  its  preconditions  are  match^  to  the  current 
state.  If  all  of  them  match,  the  operator  is  executed.  If 
some  do  not  match,  they  become  subgoals.  Thus,  pre¬ 
conditions  determine  the  goal  structure  of  the  problem 
solving.  GiPS  takes  an  indirect  approach  to  adjusting 
preconditions.  In  addition  to  the  selection  concept, 
GiPS  provides  a  second  concept,  called  the  execution 
concept,  which  serves  as  a  repository  for  its  experience 
in  attempting  to  execute  the  operator.  When  valura 
in  the  execution  concept  cross  a  threshold,  appropri¬ 
ate  modifications  are  made  to  the  preconditions.  Thus, 
GlPS  only  modifies  preconditions  when  warranted  by 
a  great  deal  of  experience.  The  next  two  paragraphs 
describe  exactly  how  execution  concepts  are  used. 

When  an  operator  is  selected,  the  system  matchm 
the  execution  concept  to  the  current  TRANSFORM  in 
the  same  way  that  selection  concepts  are  matched.  If 
the  calculation  returns  a  value  greater  than  1,  GiPS 
attempts  to  execute  the  operator.  If  the  value  is  less 
than  1,  GiPS  follows  the  standard  MEA  practice  of 
matching  the  preconditions  of  the  operator  to  the  cur¬ 
rent  state  and  setting  up  subgoals  for  the  unmatched 
ones. 

When  GiPS  attempts  to  execute  an  operator,  it  has 
no  arms  and  eyes  so  it  cannot  tell  if  its  attempt  suc¬ 
ceeds,  thus  it  asks  the  user.  If  an  operator  succeeds, 
the  values  in  the  execution  concept  are  updated  ap¬ 
propriately  and  the  new  state  becomes  current.  If  an 
operator  fails,  the  values  are  updated  and  the  system 
follows  the  standard  MEA  routine  of  matching  precon¬ 
ditions  and  setting  up  subgoals. 

When  the  sufficiency  value  for  a  literal  in  the  execu¬ 
tion  concept  crosses  a  threshold,  that  literal  is  added 
to  the  operator’s  preconditions.  Thus,  GlPS  only  adds 
a  literal  when  that  literal  has  been  found  time  and 
time  again  to  be  present  when  the  operator  executes. 
However,  GlPS  is  quick  to  remove  a  literal  from  the 
preconditions  if  it  ever  finds  that  the  literal  is  not  ac¬ 
tually  necessary  for  the  execution  of  the  operator.  If 
the  system  successfully  executes  an  operator  and  not 
all  of  the  preconditions  are  satisfied,  it  removes  the 
unmatched  literals.  There  is  no  sense  continuing  to 
subgoal  for  an  unmatched  literal  if  that  literal  is  not 
in  fact  necessary  for  executing  the  operator. 

Representation  of  the  Addition  Domain 

GlPS  describes  the  world  as  a  set  of  relations  between 
objects.  In  the  addition  domain,  these  objects  and  re¬ 
lations  include  the  numbers  that  are  part  of  the  prob¬ 
lem,  the  state  of  the  problem  solver’s  “hands”  while  it 
is  adding,  and  the  value  of  a  counter  that  the  problem 
solver  keeps  “in  its  head.” 

GlPS  requires  16  operators  to  represent  the  addition 
domain.  There  are  two  particular  operators,  which  we 
refer  to  as  the  End-Count  operators,  that  are  in¬ 
volved  in  most  of  the  strategy  shifts.  For  future  refer¬ 
ence.  the  series  of  preconditions  that  the  Left-End- 
CoUNT  operator  acquires  appears  in  Table  1.  The  16 
operators’  selection  concepts  were  initialized  so  that 
the  system  generates  the  Sum  strategy.  The  literals 
of  each  operator’s  selection  concept  were  the  precon¬ 


ditions  and  the  goals  that  the  operator  could  satisfy. 
The  necessity  and  sufficiency  of  these  literals  were  set 
so  that  they  would  be  retrieved  in  either  a  backward¬ 
chaining  or  forward-chaining  fashion,  depending  on  the 
role  of  the  operator  in  the  domain. 

Table  1.  A  series  of  preconditions  for  Left-End- 
COUNT. 


SUM  strategy  (a): 

RaisingCLefthand) 

Counting(Lslthaiid) 

As8igned(Lafthand,=Valus) 

Couzitar-value(=Valus) 


SUM  strategy  (b): 

Raising(Lsfthand) 

Connting(Lafthand) 

As s i gnad ( La f t hand , s Val ua } 
Conntar- valua ( =Valna ) 

Rai 8  ad-f ingars (Lai thand ,= Valua ) 


SHORTCUT  SUM  strategy  (c): 
Raising(Lalthand) 
Counting(Lalthand) 

As 8 ignad ( Lai thand ,= Value ) 

Rai 8 ed-1 ingars ( Lai thand ,= Value ) 


FIRST  strategy  (d): 
RaisingCLalthand) 
Counting(Lelthand) 
issignadCLalthand, sVaiua) 


Strategy  Acquisition  in  the  Addition 
lomam 

This  section  presents  GiPS’  behavior  through  a  series 
of  different  strategies  for  adding  numbers.  These  strat¬ 
egy  shifts  arise  from  the  learning  algorithm  incorpo¬ 
rated  into  the  system,  and  they  correspond  well  with 
the  shifts  observed  by  Siegler  and  Jenkins.  Siegler  and 
Jenkins  classified  their  subjects’  behavior  into  8  strate¬ 
gies,  of  which  4  were  based  on  counting  (the  others  in¬ 
volved  various  kinds  of  recognition  and  guessing).  In 
this  section,  we  describe  each  of  the  4  counting  strate¬ 
gies  in  the  order  in  which  they  generally  appear.  How¬ 
ever,  it  is  important  to  note  that  children  always  in¬ 
termingle  their  strategies,  sometimes  even  on  a  trial 
by  trial  basis.  We  will  discuss  the  issue  of  strategy 
variability  in  the  next  section. 

The  Sum  Strategy 

GlPS’  initial  strategy  for  addition  is  the  SUM  strategy. 
The  first  thing  the  system  does  is  assign  an  addend  to 
each  hand.  For  example,  when  adding  2  and  3,  the  sys¬ 
tem  may  2issign  the  number  2  to  the  left  hand  and  the 
number  3  to  the  right  hand.  However,  in  this  strategy 
the  order  of  the  addends  does  not  make  a  difference, 
so  it  could  just  as  easily  have  switched  them. 

Next,  the  system  begins  its  procedure  of  counting 
out  a  set  of  fingers  on  each  hand.  To  accomplish  this 
task  the  End-Count  operators  initially  use  a  counter 
to  determine  when  a  hand  is  finished  being  counted 
out.  For  example,  the  preconditions  of  Left-End- 
Count  demand  that  the  system  be  raising  fingers  on 
the  left  hand,  and  that  the  value  of  the  counter  be 


equal  to  the  value  of  the  left-hand  addend.  These  pre¬ 
conditions  are  set  up  as  subgoais,  causing  the  selec¬ 
tion  of  the  Start-Raise  and  Start-Count  opera¬ 
tors,  which  initialize  the  forward-chaining  proc^ure 
of  raising  and  counting  fingers  one  at  a  time.  These 
operators  execute  alternately  until  Left-END-Count 
can  execute,  when  the  correct  number  of  fingers  have 
been  counted  on  the  left  band. 

After  the  left  hand  has  been  counted,  the  Clobber- 
COUNTER  operator  immediately  executes.  This  oper¬ 
ator  executes  when  all  the  fingers  of  a  hand  have  been 
raised  along  with  a  running  count.  Its  effects  are  to 
zero  the  value  c*"  the  counter  to  prepare  it  for  the  next 
hand,  and  to  mark  the  current  hand  as  uncounted,  be¬ 
cause  the  counter’s  value  has  been  changed.  This  en¬ 
tire  procedure  then  repeats  with  the  right  hand. 

After  both  hands  have  been  counted,  DETERMINE- 
Answer  checks  whether  it  can  execute.  It  can  only 
execute  if  both  hands  are  marked  as  counted,  but  ex¬ 
ecution  of  Clobber-Counter  has  caused  this  to  be 
false.  Therefore,  the  system  again  attempts  to  count 
up  fingers  on  each  hand,  this  time  marking  fingers  that 
are  already  raised.  For  this  procedure  no  Clobber- 
Counter  is  necessary,  because  the  number  of  raised 
fingers  (rather  than  the  value  of  the  counter)  is  used 
to  terminate  the  count  for  each  hand.  Finally,  after 
each  hand  has  been  counted  for  the  second  time,  GiPS 
announces  the  answer. 

As  the  system  repeatedly  solves  addition  problems, 
it  continuously  updates  the  execution  concepts  for  the 
End-Count  operators.  After  a  while,  the  concept  en¬ 
codes  several  regularities  that  are  always  true  when 
these  operators  execute.  For  example,  there  are  al¬ 
ways  two  addends  in  the  problem  description,  and  the 
number  of  “marked”  fingers  is  always  zero.  Most  im¬ 
portantly,  however,  the  concept  encodes  the  number 
of  raised  fingers  is  always  equal  to  the  counter  value 
(which  in  turn  is  equal  to  the  goal  value  for  counting 
an  addend).  Thus,  these  literals  eventually  get  added 
into  the  preconditions  for  the  End-Count  operators 
(see  Table  1(b)).  This  action  alone  does  not  change 
the  system’s  outward  behavior,  but  it  proves  impor¬ 
tant  for  later  strategies. 

The  Shortcut  Sum  Strategy 
After  new  preconditions  have  been  added  and  a  num¬ 
ber  of  addition  problems  have  been  solved,  the  new 
literals  in  the  system’s  execution  concepts  for  Left- 
End-Count  and  Right-End-Count  become  strong 
enough  that  GiPS  attempts  to  execute  the  operators 
earlier  than  usual.  At  some  point,  it  thinks  that  the 
operators  should  execute  when  the  number  of  fingers 
raised  on  a  hand  is  equal  to  the  goal  value  even  though 
the  system  has  not  yet  incremented  its  count  for  the 
last  finger.  It  turns  out  that  the  system  can  success¬ 
fully  solve  the  addition  problem  even  if  it  executes  this 
operator  prematurely,  so  it  deletes  the  condition  that 
the  current  counter  value  must  be  equal  to  the  goal 
value  in  the  preconditions  of  the  End-Count  opera¬ 
tors  (see  Table  1(c)). 

This  change  has  a  direct  effect  on  GiPS’  behavior. 
When  attempting  to  apply  Left-End-Count,  the 
value  of  the  counter  no  longer  appears  in  the  precon¬ 
ditions,  so  it  is  not  posted  as  a  subgoal.  This  means 
that  the  Start-Count  operator  is  no  longer  selected. 
Thus,  a  running  count  is  still  kept  while  raising  fingers, 
but  the  counter  is  not  marked  for  use  as  the  termina¬ 
tion  criterion.  This  means  that  Clobber-Counter 


will  not  fire,  and  that  leads  to  two  changes  in  strategy. 
First,  the  counter  is  not  reset  to  zero  after  counting  the 
left  hand,  and  counting  continues  from  the  left  hand’s 
final  value.  Second,  the  hands  are  not  marked  as  un¬ 
counted,  so  there  is  no  need  to  count  up  the  raised 
fingers  again  after  the  two  hands  have  initially  been 
counted.  This  behavior  corresponds  to  the  Shortcut 
Sum  strategy,  which  was  invented  by  all  8  of  Siegler 
and  Jenkins’  subjects. 

The  Shortcut  Min  Strategy 
The  next  shift  leads  to  an  intermediate  strategy  be¬ 
tween  Shortcut  Sum  and  Min,  which  we  call  Short¬ 
cut  Min.  Although  Siegler  and  Jenkins  do  not  classify 
Shortcut  Min  as  a  distinct  strategy  from  Shortcut 
Sum,  they  do  note  (p.  119)  that  some  of  their  subjects 
begin  to  switch  addends  during  Shortcut  Sum  so 
that  they  start  counting  with  the  larger  addend  on  the 
left  hand,  rather  than  just  picking  whichever  addend 
appears  first  in  the  problem.  GiPS  can  also  account 
for  this  behavior. 

An  important  feature  of  the  Shortcut  Sum  strat¬ 
egy  is  that  the  problem  solver’s  counter  value  is  not 
equal  to  the  number  of  fingers  being  raised  on  the  right 
hand  (i.e.,  the  second  hand).  We  hypothesize  that  this 
causes  interference  and  subsequent  failure.  Such  inter¬ 
ference  would  not  occur  with  the  left  hand,  because  the 
number  of  raised  fingers  in  the  Shortcut  Sum  strat¬ 
egy  is  always  equal  to  the  value  of  the  counter  for  that 
hand.  We  simulated  interference  between  the  value  of 
the  counter  and  the  number  of  fingers  raised  on  the 
right  hand  by  causing  GiPS  to  fail  sometimes  during 
the  Shortcut  Sum  strategy  when  it  decided  to  count 
the  larger  addend  on  its  right  hand.  This  caused  the 
system  to  update  the  selection  concept  for  the  opera¬ 
tor  that  initially  assigns  an  addend  to  each  band,  so 
that  it  would  prefer  to  count  the  smaller  addend  on 
the  right  hand. 

The  Min  Strategy 

The  final  strategy  shift  occurs  in  a  similar  manner  to 
the  shift  from  Sum  to  Shortcut  Sum.  At  this  point, 
GiPS  has  attempted  to  execute  the  End-Count  op¬ 
erators  at  various  times  and  has  been  given  feedback 
each  time  2is  to  whether  it  would  be  ame  to  solve  the 
current  problem  if  it  executed  the  operator  at  that 
time.  Thus,  it  is  slowly  learning  a  “good"  concept  for 
when  the  End-Count  operators  are  executable.  One 
of  the  things  that  proves  to  be  true  every  time  these 
operators  are  executed  is  that  the  goal  \'alue  for  count¬ 
ing  out  a  hand  is  equal  to  the  addend  assigned  to  that 
hand. 

Eventually,  the  system  attempts  to  fire  the  Left- 
End-Count  operator  without  having  raised  any  fin¬ 
gers  at  all.  When  it  succeeds  by  doing  this,  it  deletes 
the  precondition  that  the  number  of  fingers  raised  on 
the  hand  be  equal  to  the  goal  value  (see  Table  1(d)). 
The  system  has  learned  that  it  can  simply  start  count¬ 
ing  from  the  goal  value  for  the  left  hand  rather  than 
starting  from  zero.  GiPS  also  attempts  to  execute  the 
Right-End-Count  operator  early,  but  this  leads  to 
failure.  Thus,  the  system  begins  to  exhibit  the  Min 
strategy,  in  which  the  largest  number  (the  left-hand 
number)  is  simply  announced  and  used  to  continue 
counting  the  smaller  number  as  in  the  Shortcut  Min 
strategy. 


The  First  Strategy 

The  only  other  counting  strategy  found  by  Siegler  and 
Jenkins  is  the  First  strategy.  It  was  used  on  only 
6  trials,  all  by  the  same  subject.  First  is  similar 
to  the  Min  strategy,  except  that  it  does  not  assign 
the  larger  addend  to  the  left  hand.  Rather,  it  starts 
with  whichever  addend  is  presented  first,  and  contin¬ 
ues  counting  with  the  second.  In  GiPS,  this  strate^ 
follows  from  the  Shortcut  Sum  strategy  when  the 
system  does  not  learn  about  ordering  the  addends. 
While  using  the  First  strategy,  the  system  can  still 
eventually  generate  the  Min  strategy  through  the  same 
type  of  failure-driven  learning  that  leads  from  SHORT¬ 
CUT  Sum  to  Shortcut  Min. 

Summary  and  Discussion 

Both  the  Sum  strategy  and  the  Min  strategy  have 
three  main  subgoals:  to  represent  one  addend,  to  rep¬ 
resent  the  the  other  addend,  and  to  count  the  union 
of  the  representations.  The  SUM-to-MiN  transition 
involves  three  independent  modifications  to  the  SUM 
strategy:  (1)  The  subgoal  of  representing  one  addend 
changes  from  explicitly  constructing  a  set  of  objects  to 
simply  saying  the  addend.  (2)  The  order  of  addends 
is  made  conditional  on  their  size  so  that  the  larger 
addend  is  represented  by  the  easier  process.  (3)  The 
process  of  representing  the  other  addend  is  run  in  par¬ 
allel  with  counting  up  the  union.  In  the  SUM  strategy, 
representing  the  other  addend  is  finished  before  count¬ 
ing  up  the  union  begins. 

The  GiPS  account  for  each  of  these  transitions  is  as 
follows.  The  first  transition  is  caused  by  correlational 
laming  of  preconditions.  GiPS  keeps  track  of  which 
literals  in  the  situation  are  correlated  with  the  final 
achievement  of  the  goal  of  representing  the  first  ad¬ 
dend.  Eventually,  it  considers  these  correlated  literals 
to  be  just  as  essential  as  the  originally  specified  pre¬ 
conditions.  It  eventually  discovers  that  the  originally 
specified  preconditions  can  be  ignored  as  long  as  the 
correlated  literals  are  achieved. 

The  second  transition  is  caused  by  normal  failure- 
driven  learning.  The  system  uses  two  apparently  equiv¬ 
alent  methods,  but  persistent  errors  in  one  of  them 
causes  the  other  one  to  eventually  dominate. 

The  third  transition  again  involves  a  correlational 
type  of  learning.  The  Count  operator  is  responsible 
for  incrementing  the  oral  counter.  Initially,  it  is  se¬ 
lected  only  when  the  subgoal  of  counting-up  an  addend 
is  present.  Eventually,  correlated  relations  that  are 
resent  in  the  current  state  (i.e.,  that  a  finger  has  just 
een  raised)  come  to  dominate  the  selection  concept, 
and  the  operator  becomes  a  forward-chaining  opera¬ 
tor.  Basically,  the  person  has  developed  the  habit  of 
counting  whenever  they  raise  a  finger  even  if  that  count 
doesn’t  serve  any  direct  purpose.  Although  GiPS  could 
learn  this  habit,  we  actually  gave  Count  a  forward¬ 
chaining  selection  concent  in  our  experiments  in  or¬ 
der  to  save  time.  Given  this  habit,  it  serendipitously 
achieves  the  goal  of  counting  the  union  even  when  the 
counter  is  no  longer  used  to  represent  the  second  ad¬ 
dend. 

Although  this  summary  leaves  out  some  crucial  de¬ 
tails,  it  makes  it  clear  that  correlational  learning  is  cru¬ 
cial  to  the  GiPS  account  for  the  first  and  third  transi¬ 
tions.  Ordinary  failure  driven  learning  can  handle  the 
second. 

Our  analysis  with  GiPS  helps  clarify  several  impor¬ 


tant,  general  issues  about  strategy  change.  Siegler  and 
Jenkins  observe  that,  “Not  one  child  adopted  a  strat¬ 
egy  that  could  be  classified  as  indicating  a  lack  of  un¬ 
derstanding  of  the  goals  of  addition.”  (p.  107)  In  this 
respect,  the  subjects  are  similar  to  those  of  Gelman 
and  Gallistel  (1978)  who  found  that  very  young  chil¬ 
dren  would  invent  correct  strategies  for  counting  a  set 
of  objects  even  when  unusual  constraints  were  placed 
on  them  to  thwart  their  normal  strategy. 

The  initial  knowledge  given  to  GiPS  does  not  include 
any  explicit  principles  of  addition  or  counting.  As  far 
as  it  is  concerned,  the  Sum  strategy  is  just  a  song  that 
has  to  be  sung  the  right  way.  How  then  does  it  avoid 
developing  bad  strategies?  In  the  Siegler  and  Jenkins 
study,  students  were  told  after  each  trial  whether  they 
got  the  problem  right.  This  kind  of  feedback  is  crucial 
to  GiPS’  learning.  GiPS  occasionally  attempts  to  ex¬ 
ecute  an  operator  in  situations  that  would  produce  a 
wrong  answer.  If  it  were  not  told  that  the  execution 
was  wrong,  it  would  develop  wrong  strategies.  This 
demonstrates  that  an  innate  understanding  of  addi¬ 
tion  is  not  necessary  for  a  computationally  sufficient 
account  of  the  observed  competence. 

A  common  misconception  about  discovery  is  that  a 
newly  discovered  strategy  or  concept  instantly  and  to¬ 
tally  supplants  its  predecessor.  In  all  protocol-based 
studies  of  discovery  (e.g.,  Kuhn,  Amsel,  k  O’Laughlin, 
1988;  Siegler  k  Jenkins,  1989;  VanLehn,  in  press),  the 
transition  between  the  old  strategy  and  the  new  one 
is  gradual.  We  have  not  tried  to  model  the  gradual 
transition  to  the  use  of  the  Min  strategy  with  GiPS 
because  doing  it  right  would  require  implementing  sev¬ 
eral  memory-based  strategies.  However,  it  is  clear  that 
the  probabilistic  nature  of  GlPS’  selection  and  execu¬ 
tion  concepts  would  tend  to  predict  a  gradual  transi¬ 
tion. 

Starting  in  the  eighth  week  of  the  study,  Siegler  and 
Jenkins  began  including  “impasse  problems,”  such  as 
2-f23.  They  had  hoped  that  these  would  encourage 
discovery  of  the  Min  strategy,  but  they  did  not,  for  no 
child  first  used  the  MlN  strate^  on  an  impasse  prob¬ 
lem.  However,  children  who  had  already  discovered 
the  Min  strategy  began  to  use  it  much  more  fr^uently 
on  the  impasse  problems  and  even  on  the  non-impasse 
problems  that  followed  the  eighth  week.  GiPS  would 
tend  to  do  the  same  thing  if  it  were  jiven  impasse 
problems.  The  larger  addend  would  invite  errors  dur¬ 
ing  the  Shortcut  Sum  and  First  strategies,  which 
would  lower  the  values  of  their  selection  concepts.  The 
inclusion  of  impasse  problems  would  not  effect  the  er¬ 
ror  rate  of  the  MlN  strategy,  so  it  would  gradually 
become  the  preferred  strategy  for  all  counting  trials. 

Siegler  and  Jenkins  noticed  that  some  children  were 
consciously  aware  that  they  had  invented  a  new  strat¬ 
egy'  in  that  they  could  explain  it  on  the  first  trial  where 
they  used  it,  and  some  even  recognized  that  it  was  a 
“smart  answer,”  in  the  words  of  one  child.  Other  chil¬ 
dren  denied  using  the  .MlN  strategy  even  when  the 
videotape  showed  that  they  had  used  it.  Siegler  and 
Jenkins  divided  children  into  those  who  seemed  con¬ 
scious  of  the  strategy  and  those  who  did  not,  and  mea¬ 
sured  the  frequency  of  their  subsequent  usage  of  the 
Min  strategy.  The  high  awareness  group  used  the  Min 
strategy  on  about  60%  of  the  trials  where  they  used 
any  counting  strategy.  The  low  awareness  group  used 
the  Min  strategy  on  less  than  10%  of  the  trials.  This 
suggests  that  being  aware  of  a  newly  discovered  strat¬ 
egy  facilitates  subsequent  usage  of  it. 


This  finding  cannot  be  modeled  by  GiPS  because 
GiPS  has  no  way  to  distinguish  a  strategy  that  can  be 
explained  from  one  that  is  inaccessible  to  conscious¬ 
ness.  However,  the  finding  could  probably  be  modeled 
by  combining  GiPS  with  a  symbolic  example-learning 
system  such  as  Cascade  (VanLehn  &  Jones,  in  press; 
VanLehn,  Jones,  k  Chi,  1991).  In  the  new  system, 
GiPS  would  discover  a  strategy  and  store  a  trace  of 
the  strategy’s  actions  in  memory.  This  trace  would 
be  used  as  an  example  to  be  explained  by  the  sec¬ 
ond  system.  If  enough  of  the  trace  can  be  recalled  for 
the  explanation  to  succeed,  it  annotates  the  steps  in 
the  trace  and  perhaps  the  operators  whose  executions 
produced  the  steps.  These  elaborations  would  make  it 
easier  to  retrieve  the  modified  operators  from  memory, 
and  perhaps  help  in  assigning  credit  and  blame,  thus 
speeding  the  adjustment  of  the  preconditions,  selec¬ 
tion,  and  execution  concepts.  These  influences  would 
increase  the  usage  of  the  new  strategy  on  subsequent 
problems. 

To  summarize,  GiPS  achieves  its  main  research  ob¬ 
jective,  providing  a  computational  account  of  the  sev¬ 
eral  strategy  shifts  observed  during  the  SuM-to-MlN 
transition.  It  uses  plausible  local  processes,  rather 
than  global  optimization  techniques  as  required  by  the 
HPM  system.  In  addition,  GiPS  uses  modest  amounts 
of  storage,  in  contrast  to  HPM,  which  stores  com¬ 
plete  solution  traces  for  indefinite  periods.  Most  im¬ 
portantly,  GiPS  produces  all  and  only  the  transitional 
strategies  observed  in  the  Siegler  and  Jenkins  study. 
The  GiPS  analysis  solves  &  number  of  puzzles  raised  by 
the  Siegler  and  Jenkins  study.  These  include  the  abil¬ 
ity  to  make  significant  strategy  shifts  without  impasse- 
driven  learning,  and  to  avoid  inventing  bad  strategies 
without  assuming  innate  knowledge  of  the  principles  of 
addition.  Thus,  GiPS  provides  a  plausible,  computa¬ 
tionally  sufficient  account  of  the  discovery  of  the  MlN 
strategy.  However,  Siegler  and  Jenkins  produced  a  sec¬ 
ond  set  of  findings  on  the  gradual  increase  in  usage  of 
the  newly  discovered  strategy.  We  have  not  yet  tried 
to  model  these  findings,  but  GiPS  seems  to  provide  an 
appropriate  framework  for  doing  so. 
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